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The emergence and rapid spread of New Delhi MetaIlo-beta-lactamase-1 (NDM-1) -producing Klebsiella 
pneumoniae strains has caused a great concern worldwide. To better understand the mechanisms 
underlying environmental adaptation of those highly drug-resistant K. pneumoniae strains, we took 
advantage of the China's Shenzhou 10 spacecraft mission to conduct comparative genomic and 
transcriptomic analysis of a NDM-1 K. pneumoniae strain (ATCC BAA-2146) being cultivated under 
different conditions. The samples were recovered from semisolid medium placed on the ground (D strain), 
in simulated space condition (M strain), or in Shenzhou 10 spacecraft (T strain) for analysis. Our data 
revealed multiple variations underlying pathogen adaptation into different environments in terms of 
changes in morphology, H2O2 tolerance and biofilm formation ability, genomic stability and regulation of 
metabolic pathways. Additionally, we found a few non-coding RNAs to be differentially regulated. The 
results are helpful for better understanding the adaptive mechanisms of drug-resistant bacterial pathogens. 



f ^^/efo!e//cip«eMmo«Meis a highly important bacterial pathogen that causes a wide range of community- and 
fC hospital- acquired infections, such as pneumonia, urinary tract infections, intra-abdominal infections and 
■ ^ respiratory tract infections' "^. The emergence and rapid spread of carbapenem-resistant K. pneumoniae 
isolates such as the New Delhi Metallo-beta-lactamase-1 (NDM-l)-producing strains has caused great concern 
worldwide' The NDM-1, an Ambler class B metallo-P-lactamase (MBL), is capable of hydrolyzing all (B-lactams 
(including carbapenems) except monobactams. It was first identified in K. pneumoniae and Escherichia coli in 
India in 2008. To date, NDM-1 producing pathogens have been reported in more than 40 countries'""*. The 
successful persistence and transmission of this "superbug" remain largely unknown, but could be partly due to the 
ability of this pathogen to sense and react to environmental and host stress signals, which allow it to persist and 
disseminate in harsh conditions such as in various medical settings and inside the human host. Thus, under- 
standing the ability of those pathogens to adapt to various stressors they encountered during environmental 
persistence could facilitate the comprehension of their pathobiology. In addition, identification of specific 
genomic variations and transcriptome patterns critical for drug resistance and environmental adaptation of 
the drug-resistant pathogenic bacteria is important for the development of more effective pathogen control 
strategies. 

During spaceflight, a variety of physiological stressors associated with the space environment and spacecraft 
conditions could potentially contribute to detrimental alterations in the human immune system. At the same 
time, the bacteria introduced into the extraterrestrial environments by the space crew members inhabiting the 
stressful airspace environments might develop resistance traits which could be important to spaceflight missions 
and the general public medicine. Indeed, the recurrent isolation of various extremotolerant bacteria from space- 
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Figure 1 | Morphology of the K. pneumoniae strains, (a) Scanning electron micrographs of the K. pneumoniae strains, (b) Colonies of the strains on LB 
agar plates. Scale bar, 5 |j.m. 



crafts emphasizes the possibility of those microorganisms gaining 
access to spacecrafts and being transferred to extraterrestrial envir- 
onments'^ Furthermore, it has been demonstrated that following 
spaceflight the gene expression pattern of Salmonella enterica sero- 
type Typhimurium changed and the virulence of those bacteria 
increased'^. However, the microbial mechanisms of resistance in 
those pathogenic bacteria during spaceflight remained largely 
unknown. The extreme conditions encountered by the microorgan- 
isms in air space could facilitate the occurrence of the environmental 
adaptation-associated genomic variations and transcriptomic 
changes in them, and that information could be useful for taking 
pre-emptive measures to manage medically important drug- resistant 
pathogens on earth. The Shenzhou-10 spacecraft, which was 
launched in June 11, 2013 and returned to earth after completion 
of its 15-day spaceflight, is China's fifth manned space mission and 
its longest so far, thus it provided us a unique opportunity to explore 
the underlying mechanisms important for bacterial adaptation to 
extreme environments. 

In this study, we conducted comparative whole genome sequen- 
cing and RNA sequencing (RNA-Seq) analysis of a NDM-1- pro- 
ducing K. pneumoniae strain (ATCC BAA- 2146). The K. 
pneumoniae strains were cultivated in semisolid medium and the 
culture placed in the following conditions: on the ground (D strain), 
in simulated space condition (M strain), and in the Shenzhou 10 
spacecraft (T strain). Comparative genomic analysis revealed 3 and 
2 strain-specific mutations in M and T strains, respectively. The 
comparative transcriptomic analysis revealed different transcrip- 
tome patterns among the strains, especially with respect to the regu- 
lation of multiple metabolic pathways. Additionally, we found a few 
non-coding RNAs (ncRNAs) to be differentially regulated. 
Phenotype comparisons showed that the T strain exhibited elongated 
forms, reduced hydrogen peroxide (H2O2) tolerance and increased 
biofilm formation ability. Our results provide new insights into the 
environmental adaptability of drug- resistant K. pneumoniae. These 
findings could contribute to understanding the persistence and wide- 
spread of those highly drug-resistant pathogenic bacteria. 

Results 

Phenotypic characteristics of K. pneumoniae strains. The K. pneu- 
moniae ATCC BAA-2146 was an ESBL-positive and NDM-1- 
producing strain. According to multilocus sequence typing 
(MLST), this strain belonged to sequence type 11 (STll). Drug 
susceptibility testing results showed that the reference strain ATCC 
BAA-2146 was resistant to all antibiotics tested, and the drug 
resistance profiles of the recovered D, M and T strains remained 
unchanged (Supplementary Table SI). No notable differences in 
colony morphology of the strains were observed under light 



microscopy. Field emission scanning electron microscopy (SEM) 
was further used to monitor single cell morphology of the strains 
and the results showed that the bacterial cell walls of all strains were 
intact. But interestingly, some of the cells of the T strain turned into 
elongated forms and adhered to each other under SEM (Fig. 1 ). In the 
H2O2 sensitivity assay, the 1 mM and 5 mM H2O2 exposure resulted 
in slightly reduced survival for the T strain in comparison with the 
BAA-2146 control strain (P < 0.001), whereas, the D and M strains 
showed no obvious differences in sensitivity to H2O2 when compared 
to the BAA-2146 strain (Fig. 2a). The ability of K. pneumoniae to 
form biofUms is thought to be an important phenotype with respect 
to adaptability and virulence traits such as host colonization, 
antibiotic resistance, and environmental persistence, etc."". We 
therefore performed biofilm assays to determine the biofilm 
formation abilities of the strains, and the results showed that the T 
strain showed enhanced biofilm- forming capacity compared to the 
BAA-2146 strain (P < 0.001) while the D stram and the M strain 
showed similar biofUm-forming capacity with the reference strain 
BAA-2146 (Fig. 2b). The fitness of the K. pneumoniae strains was 
further assessed by determining the growth rates of the strains in a 
nutrient LB medium as well as in chemically defined media (CDM), 
as this parameter also could contribute to environmental adaptabi- 
lity. AU strains showed similar growth curves in LB medium (Fig. 3a) 
and in CDM (Fig. 3b). Furthermore, the growth curves were also 
similar when the bacteria were cultivated in LB medium at different 
temperatures including 30°C, 37°C and 40°C (Fig. 3c, 3a and 3d). To 
better understand the phenotypic changes in the strains, we also 
performed carbon source utilization assays and chemical 
sensitivity assays using the 96-well Biolog GEN III MicroPlate. 
Among the 71 carbon source utilization assays, the a-D-Lactose 
utilization ability was found to be defective in all three strains 
(including the D, M and T strains) as compared to the reference 
strain BAA-2146. Interestingly, the M strain was shown to gain the 
ability to use D-Mannose. Among the 23 chemical sensitivity assays, 
no obvious changes were observed for all three strains compared to 
the reference strain (Supplementary Table S2). 

Whole genome sequencing statistics. The basic whole genome 
sequencing statistics were shown in Table 1. The sequencing depth 
ranged between 127X and 128X and read mapping results reported 
nearly complete genome coverage (—99.99%) for all three strains. 
Fig. 4 showed the circular representation of the genome features 
including the COG annotated coding sequences, KEGG enzymes, 
RNA genes, GC content, GC skew, strain-specific SNPs, etc. As 
shown in Supplementary Fig. SI, aU three samples had similar 
mapping coverage across the reference genome and the coverage 
was higher for the regions closer to the origin of replication. 
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Figure 2 | Oxidative stress tolerance and biofllm formation ability of K. pneumoniae strains, (a) Survival of K. pneumoniae strains in oxidative 
stressed environment. The ATCC BAA-2146, D, M and T strains were incubated with different concentrations of H2O2 in sodium phosphate 
buffer, and then tested for their survival on LB plates. Growth is expressed as relative survival after 2 h incubation with H2O2. (b) Biofilm assays of 
K. pneumoniae strains. OD595 readings were measured for each strain to determine the amount of biofilm formed. Data represent one of three 
independent experiments. 
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Figure 3 | Growth curves of the K. pneumoniae strains in LB medium (a, c and d) or CDM (b). Overnight-grown bacteria were diluted 100 folds in 
fresh LB medium or CDM and were cultivated at designated temperature with shaking (180 rpm). ODgoo readings were measured at designated time 
points for each strain. The data represent one of two independent experiments performed in triplicate, with s.d. indicated by error bars. 
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Table 1 | Statistics of 


whole genome sequencing 




Sample 


D strain 


M strain 


T strain 


IVUW rcuQ 5TaTI5TIC5 








Total reads 


Am 1119 
^yj 1 1 1 1 z 


oooy 


/im Q QQO 


Total base pairs 


799 


797 


/ Z/ 


Sequencing depth (X) 


127 


128 


1 ZO 


Genome coverage [%] 


99.99 


99.99 


vv. vy 


Assembly statistics 








Chromosome size 


5,674,681 


5,671,078 


5,671,920 


No. of scaffolds 


72 


73 


66 


Largest scaffold length 


840,095 


840,520 


839,882 


N50 scaffold length 


236,010 


236,344 


288,168 


G+C content (%) 


57.0 


57.0 


57.0 



probably because of a higher speed of the DNA replication than that 
of cellular doubling during exponential growth. 

Identification of the mutations in D, M and T strains compared to 
reference strain ATCC BAA-2146. A total of 34 mutations 
(excluding the synonymous mutations) were identified in all 
strains (Supplementary Tables S3), among which, 25 mutations 
were shared by all three strains (Supplementary Tables S4). No 
strain-specific mutation was identified in the D strain, while 3 and 
2 strain-specific mutations were identified in the T and M strains, 
respectively. In addition, a few mutations were shared by T and M, T 
and D, or M and D strains (Table 2). PGR and sequencing analysis 
confirmed that these mutations were not sequencing or assembly 
errors. 

Drug resistance determinants of K. pneumoniae strains. Electronic 
PGR was conducted for ATGG BAA-2146, D, M and T strains to 
analyze drug resistance determinants conferring resistance to 



carbapenems, folate pathway inhibitors, fluoroquinolones, aminog- 
lycosides, etc. The drug resistance-associated genes and gene 
mutations detected in the D, M and T strains were identical to 
those of the reference strain ATGG BAA-2146 (Supplementary 
Table S5). 

RNA-Seq mapping statistics. Approximately 96% of the sequencing 
reads could be mapped to ATGG BAA-2146 (GP006659) reference 
genome (Table 3). The uniquely mapped reads for D, M, and T were 
96.0%, 95.9% and 96.3% respectively. The number of reads mapped 
each gene ranged from 1 to 449,669 with a median of 206, 209 and 
214 for D, M and T, respectively. In addition, the RPKM for D, M and 
T was 48.8, 49.5 and 52.3 for D, M and T respectively. The 
distribution of the number of mapped reads and RPKM values 
across all three samples was displayed in Supplementary Fig. S2. 
The expression of total genes and the differentially expressed genes 
identified among D, M and T strains (fold changes >2) were shown 
in Fig. 5. 

Comparative transcriptomic analysis. By using the KEGG ortho- 
logy based annotation system to identify metabolic pathways, we 
identified extensive changes in the transcriptomes of the M and T 
strains in comparison to the D strain. Expression of the differentially 
expressed genes identified among D, M and T strains (including: T 
strain vs. D strain, M strain vs. D strain, and T strain vs. M strain) 
were shown in Supplementary Tables S6-S8. Gompared to the D 
strain transcriptome, the T strain transcriptome was characterized 
by regulation of a number of genes involved in fatty acid degradation 
(P=0.0464189), microbial metabolism in diverse environments 
(P=0.0464189), and phenylalanine metabolism (P=0.0464189). 
Among those three categories, the majority of the genes involved 
in fatty acid degradation and phenylalanine metabolism were 
down-regulated, while the genes involved in microbial metabolism 
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Figure 4 | Circular representation of tlie genome features. Genome sequences (ring 1), COG Annotated coding sequences (rings 2 -I- 3), KEGG enzyme 
(ring 4), RNA genes (ring 5: red, rRNA; blue, tRNA), GC content (rings 6), GC slcew (ring 7), strain-specific SNPs (ring 8) are shown. AU strain-specific 
SNPs from D (yellow), M (red), and T (blue) are shown on ring 8. Very short features were enlarged to enhance visibility. Clustered genes and 
SNPs, such as several rRNA genes, may appear as one line due to space limitations. The image was created by using the software Circos. 
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Table 2 | Comparative analysis of SNPs and Indels identified in D, M, and T strains. Synonymous SNPs were excluded 



Mutation position 



Gene name 



Product 



Base mutation AA mutation Type of mutation 



SNPs and Indels specific to T strain 

G1539668C Kpn2146_1513 

T2546875G Kpn2 1 46_2552/Kpn2 1 46_2553 



D-3-pfiosphoglycerate G20C 

dehydrogenase 

4-carboxymuconolactone T-327G 

decarboxylase/Hypothetical 

protein 

small subunit ribosomal RNA G1400A 



G4946124A Kpn2146_5015 
SNPs and Indels specific to M strain 

T3252757A Kpn2 1 46_3273/Kpn2 1 46_3274 
C3252759T Kpn2 1 46_3273/Kpn2 1 46_3274 
SNPs and Indels specific to D strain 

None 

Non-synonymous SNPs and Indels shared by T strain and M strain and not present in D strain 



RtT RNA/tRNA-Tyr(GTA) 
RtT RNA/tRNA-Tyr(GTA) 



T-39A 
C-37T 



G2613880A Kpn2146_2620/Kpn2146_2621 Ambler Class A beta lactamase 

SHV-1 1 /Hypothetical protein 

SNPs and Indels shared by T strain and D strain and not present in M strain 

C5323238G Kpn2146_5387/Kpn2146_5388 Transcriptional regulator RutR of 

pyrimidine catabolism (TetR 
familyj/Putative acetyltransferase 

SNPs and Indels shared by M strain and D strain and not present in T strain 



G2610928T Kpn2146_2617 
G26 1 2995C Kpn2 1 46_26 1 9/Kpn2 1 46_2620 



Hypothetical protein 
DeoR family transcriptional 
regulator probably involved in 
glycerate glycolaldehyde 
metabolism/Ambler Class A beta 
lactamase SHV-1 1 



G-274A 



C-289G 



Gn2T 
G-IC 



R7P 



L38I 



Non-synonymous 
Intergenic regions 



Intergenic regions 
Intergenic regions 



Intergenic regions 
Intergenic regions 



Non-synonymous 
Intergenic regions 



in diverse environments include many up-regulated and many down- 
regulated genes (Supplementary Table S6). Compared to the D strain 
transcriptome, the M strain transcriptome showed that many genes 
associated with tyrosine metabolism (P<0.001), degradation of 
aromatic compounds (P=0.003804639), and microbial metabo- 
lism in diverse environments (P= 0.005292205) were regulated 
(Supplementary Table S7), and the majority of those genes were 
up-regulated. Genes altered in their expression common to both M 
strain and T strain transcriptomes included a variety of genes 
involved in microbial metabolism in diverse environments. In 
comparison to the M strain, the T strain transcriptome was further 
characterized by regulation of a number of genes involved in citrate 
cycle (P=0.0179529) and phenylalanine metabolism (P=0.0404427). 
In addition, many more genes involved in microbial metabolism in 
diverse environments (P= 0.0404427) were regulated in the T strain 
than in the M strain. Additionally, we found a few ncRNAs to be 
differentially regulated in the T and M strains as compared to the D 
strain. Some of those ncRNAs (including GknZ RNA activator of 
glmS mRNA and RtT RNA) were similarly regulated in both M and T 
strains. But interestingly, sroC RNA was down- regulated in the T 
strain but up-regulated in the M strain (Table 4). The number of 



mapped reads and estimation of gene expression (RPKM) values 
between the D, M and T strains were displayed in a boxplot as 
shown in Supplementary Fig. S2. Ten genes were selected for 
quantitative RT-PCR analysis in parallel with the RNA-Seq 
analysis, and the results validated that gene expression fold changes 
from RNA-Seq analysis for specific genes were well correlated with 
transcript levels measured using real-time quantitative RT-PCR 
(qRT-PCR) analysis, supporting that RNA-Seq provides reliable 
quantitative estimates of transcript levels (Supplementary Tables S9 
and SIO). The Orthologus Groups (COG) enrichment analysis was 
also determined to show the distribution of differentially expressed 
genes (DEGs) in COG functional categories (Fig. 6). A large number 
of the genes that were differentially expressed in the T strain vs. D 
strain (T/D) play putative roles in the following two COG categories: 
amino acid transport and metabolism (P=0.01814), and carbo- 
hydrate transport and metabolism (P=0.04659). In addition, many 
genes that were differentially expressed in the M strain vs. D strain 
(M/D) belonged to the following three COG categories: Cell motility. 
Secondary metabolites biosynthesis (P=0.004304), transport and 
catabolism (P= 0.007086), and Inorganic ion transport and metabo- 
lism (P=0.0346). 



Table 3 | Summary of RNA 


sequencing data 














D 




M 




T 




Sample name 


Reads number 


Percentage 


Reads number 


Percentage 


Reads number 


Percentage 


Total reads 


13,367,754 


100.0% 


13,887,820 


1 00.0% 


13,284,972 


1 00.0% 


Total base pairs 


1,203,097,860 


100.0% 


1,249,903,800 


1 00.0% 


1,195,647,480 


1 00.0% 


Total mapped reads 


12,831,916 


96.0% 


13,321,176 


95.9% 


12,796,903 


96.3% 


Perfect match 


10,367,952 


77.6% 


10,822,875 


77.9% 


10,284,372 


77.4% 


sSbp mismatch 


2,463,964 


18.4% 


2,498,301 


1 8.0% 


2,512,531 


18.9% 


Unique match 


12,829,558 


96.0% 


13,317,899 


95.9% 


12,793,482 


96.3% 


Multi-position match 


2,358 


0.0% 


3,277 


0.0% 


3,421 


0.0% 


Total unmapped reads 


535,838 


4.0% 


566,644 


4. 1 % 


488,069 


3.7% 


Reads that aligned torRNA 


10,366 


0.2% 


1 1 ,075 


0.2% 


4,232 


0.1% 
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Figure 5 | Expression of total genes (a) and the differentially expressed genes (b) identified among D, M and T strains (Fold changes >2). The heatmap 
was generated from hierarchical cluster analysis of genes. The RPKM values were mean-centered and normalized across samples, with each row 
representing a different gene. 



Discussion 

The explanation for the epidemiological success of highly drug-res- 
istant bacterial pathogens, such as the NDM-1 -producing K. pneu- 
moniae strains, is complex. In general, the antibiotic resistance 
phenotype of bacteria is attributed to their capacity to acquire and 
express genes associated with a wide range of antibiotic resistance 
functions, as well as their intrinsic properties such as porins and gene 
mutations, etc. In this study, the reference strain ATCC BAA-2146 
was shown to possess a large number of previously reported genes 
associated with resistance to P-lactams, fluoroquinolones, aminogly- 
cosides and folate pathway inhibitors'^". Among the more recently 
reported carbapenemases genes (such as bluiMv, fo/aviM> ^'andm-i. 
fc/aKpc. and foZfloxA-48> etc.), only NDM-1 was detected in ATCC 
BAA-2146. We also identified a gyrA mutation T247A (Ser83Ile) 
which was shown to be associated with fluoroquinolone resistance". 
In this study, the D, M and T strains all remained their drug resist- 
ance phenotype and maintained all those drug resistance-associated 
genes detected in the reference strain ATCC BAA-2146, suggesting 
that this highly drug-resistant K. pneumoniae strain did not lose 
resistance to those drugs while grown in extreme conditions. Based 
on MLST genotyping analysis, ATCC BAA-2146 belonged to STll, 
which was reported to be the most dominant clone among the car- 
bapenem- resistant K. pneumoniae isolates in China^". Thus this 
drug-resistant K. pneumoniae strain could be intrinsically highly 
transmissible. 

Morphological analysis revealed that some cells of the T strain were 
elongated and adhered to each other under SEM, suggesting that 
certain genes involved in maintaining bacterial morphology and 
adherence ability could be affected or regulated. Normally, the fimc- 
tion and morphology of the bacteria are maintained by well-coordi- 
nated equilibrium of their fusion and fission activities. Therefore, the 



explanations for the mechanism underlying the formation of elon- 
gated bacteria could be an enhanced fusion process, a blocked fission 
process, or a combined action of the two. Since we did not identify 
mutations in genes directly involved in bacterial fission or fusion 
function in the T strain, we thus hypothesize that certain mutations 
which occurred in the intergenic regions or ncRNAs could play reg- 
ulatory roles on genes associated with these morphological changes in 
the T strain. We also predict that the increased bacterial fusion and 
adherence capabilities might be beneficial for the pathogen to form 
biofllm under stress conditions in order to promote bacterial survival. 
The elongated forms of the bacteria have previously been observed in 
bacteria being treated by certain antibiotics. For example, in one 
study, the scanning electron microscopy analysis demonstrated that 
Pseudomonas aeruginosa became elongated after exposure to cipro- 
floxacin^'. Another study also showed that most of the P. aeruginosa 
cells turned into elongated forms and adhere to each other when 
exposed to naUdrxic acid^"'. Thus, this morphological change could 
be a typical response of pathogenic bacteria towards various stressors. 

Bacterial cells are constantly challenged by various environmental 
stressors from their natural habitats. Similar to many pathogens, K. 
pneumoniae faces several challenges during infection and coloniza- 
tion of the human body. Among these, the ability of the pathogen to 
tolerate oxidative stress is critical for their survival since the trans- 
ition from aerobic to microaerophilic conditions or the transition 
from a microaerophilic to oxidative stress environment is frequently 
encountered by the pathogen during infection. Oxidative stress on 
aerobic bacteria is mainly mediated by partially reduced oxygen 
species, or reactive oxygen species, most notably superoxide and 
H2O2, which are by-products of aerobic metabolism. These reactive 
oxygen species can cause damage to DNA, proteins and membranes. 
As a result, all aerobic bacteria possess various mechanisms to scav- 
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Non-coding RNAs differentially expressed in different K. pneumoniae strains 
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enge superoxide and H2O2, thus protecting the cells from being 
damaged by these reactive oxygen species. Thus, the response of 
drug-resistant bacteria under oxidative stress conditions is of par- 
ticular interest. Our study revealed that the T strain exhibited a 
slightly reduced tolerance toward H2O2, this could be partly 
explained by the decreased expression of the universal stress res- 
ponse gene as well as a few ncRNAs such as sroC RNA, 6S/SsrS 
RNA, and the bacterial small signal recognition particle RNA in this 
strain, since those genes and ncRNAs were shown to be involved in 
stress tolerance or bacterial fitness^^^"^'. Interestingly, it was demon- 
strated that S. Typhimurium inside macrophages led to repression of 
the expression of the sroC and 6S/SsrS RNA genes, suggesting that 
those ncRNAs may also direct pathogen adaptation to a non-prolif- 
erative state inside the host cell'''. Taken together, we hypothesize 
that those ncRNAs could also play critical roles during envir- 
onmental adaptation of K. pneumoniae. 

The bacterial biofikn consists of an aggregate of cells contained 
within a matrix of surface polysaccharides, proteins and DNA. The 
ability to produce a biofUm results in enhanced resistance to host 
defense factors, antimicrobials, as well as various environmental stres- 
sors, thus it has been increasingly recognized as an important viru- 
lence property. An observation by Wu et al. showed that certain K. 
pneumoniae strains such as hvKP strains produced more biofUm than 
cKP strains, suggesting that biofilm formation may be a 
contributing factor to the increased virulence of the hvKP strains^**. 
In this study, though the T strain showed slightly decreased tolerance 
towards H2O2, it exhibited increased biofilm production. The similar 
growth rates of the reference strain ATCC BAA-2146 as well as the D, 
M and T strains in LB medium and CDM ruled out the possibility that 
the increased biofilm production in the T strain was due to a change in 
growth. The mechanism underlying the increase biofilm formation in 
the T strain remains currently unclear, but could be associated with 



increased expression of certain regulatory genes involved in surface 
polysaccharide formation as suggested by the increased adherence 
between bacterial cells under SEM. We also noticed that in the T 
strain a large number of up-regulated genes were involved in the 
citrate cycle, which is a central metabolic pathway that facilitates the 
adaption of bacteria to environmental stress. This finding is also con- 
sistent with a previous study which showed an increased abundance of 
genes involved in the citrate cycle in E. coli when grown as a biofikn^'. 
The results from that study and ours suggest that pathogens grown as 
a biofilm could still be energetically viable by using amino acids as an 
indirect carbon source through citrate cycle. 

Using the 96-well Biolog GEN III MicroPlate for further pheno- 
typic characterization of the strains, we observed that the cx-D- 
Lactose utilization ability was defective in all three strains as 
compared to the reference strain BAA-2146, and the M strain gained 
the ability to use D-Mannose. Regulation of metabolic pathways has 
been demonstrated to be an important stress response of pathogenic 
bacteria, and a variety of transcriptional regulators such as LysR, 
RutR and Atfl have been shown to play central roles during this 
process™ For example, LysR was shown to be an important reg- 
ulatory component of P. aeruginosa adaptation during oxidative 
stress^^l RutR was demonstrated to be involved in glutamate-depend- 
ent acid resistance of E. coli for survival under acidic conditions'''. In 
addition, many regulatory networks of the pathogen contain 
ncRNAs, which could also regulate the expression of key protein- 
coding genes during the adaptive response to environmental stim- 
^jj36,37 identified a non-synonymous mutation in Kpn2 146_20 1 8 
(P184S), which belongs to the transcriptional regulator LysR family. 
In addition, a couple of mutations were also identified in the inter- 
genic regions near Kpn2146_2018 and Kpn2146_5387, the latter of 
which is also a transcriptional regulator, RutR. We also identified two 
M strain-specific mutations in the intergenic regions between 
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Figure 6 | Distribution of differentially expressed genes in COG functional categories. The y-axis represents the number of genes in each COG category. 



Kpn2146_3273 (RtT RNA) and Kpn2146_3274 (tRNA-Tyr) {T-39A 
and C-37T). Those mutations within or nearby those transcriptional 
regulators and non-coding RNA regions could be associated with the 
regulation of metabolic pathways through up-regulating or down- 
regulating their targets in the K. pneumoniae strains during pathogen 
environmental adaptation. 

The study of genomic variations as a function of pathogen adapta- 
tion to a specific niche could reveal important insights into how they 
sense and respond to varied habitats. Our comparative genomic 
analysis revealed only a few mutations specific to each strain, but a 
large number of mutations were shared by the D, M and T strains, 
suggesting the all three strains underwent adaptation genomic var- 
iations while being cultivated in different environments, though each 
strain were exposed to different combination of stressors and the 
strength of each stressors they encounter might be different. We also 
noticed that a large proportion of the mutations occurred in the 
intergenic regions or ncRNAs, suggesting that those mutations could 
play important regulatory roles on their down-stream genes or more 
distantly located genes, which could be involved in stress response 
and environmental adaptation. There were also some mutations 
occurring in hypothetical proteins such as Kpn2146-1328, 
Kpn2146-2411, and Kpn2146-2412, the roles of which in pathogen 
environmental adaption await further studies. 



Comparative transcriptomic analysis revealed that the gene 
expression patterns of the three strains were significantly different 
from each other. The transcriptomes of both the M and T strains 
indicated significant changes in microbial metabolism compared to 
the D strain. Many more genes were regulated in the T strain com- 
pared to the M strain, suggesting that the T strain experienced more 
stress during spaceflight than the M strain which was cultivated in a 
simulated space condition with 10"' g microgravity. Furthermore, a 
large proportion of the differentially expressed genes in the T strain, as 
compared to both D and M strains, were involved in a variety of 
different metabolic pathways including the following COG categories: 
amino acid transport and metabolism, and carbohydrate transport 
and metaboUsm. These results suggest that in response to envir- 
onmental stress, pathogenic bacteria including highly drug-resistant 
ones exhibit a great flexibility and adaptability to survive successfully 
through regulation of multiple physiological functions and cellular 
pathways, among which the regulation of metabolism pathways 
seems to play a very important role. Using RNA-Seq, we also iden- 
tified a few previously reported or putative ncRNAs, which could play 
critical roles during the environmental adaptation of K. pneumoniae. 
As the RNA isolation procedure used in this study selected against 
small RNA molecules, it is likely that additional small ncRNAs not 
detected here could also be transcribed during stress responses of the 
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K. penumoniae. Nevertheless, the results from this study provide 
powerful evidence showing that RNA-Seq allow quantitative char- 
acterization of bacterial transcriptomes and provide a useful tool for 
exploring transcriptional regulatory networks in bacteria. 

In summary, our comparative genomic and transcriptomic ana- 
lysis of a NDM-1 K. pneumoniae strain being cultivated in different 
conditions support the possible resistance or adaptation of the highly 
drug-resistant pathogenic bacteria towards the extreme environments 
such as the conditions in spacecrafts, which could have important 
impacts on the microbial ecology of the extraterrestrial space. The 
present study could also serve as a basis for future studies examining 
the complex network systems that regulate bacterial adaptation to 
extreme environments. Furthermore, the insights we gained from 
the observations of persistence and pathogenesis-related genomic 
variations and transcriptomic changes of the NDM-1 K. pneumoniae 
strain in response to spaceflight could also facilitate the development 
of novel or alternative therapeutic methods needed to treat the recal- 
citrant infections caused by those highly drug-resistant K. pneumo- 
niae strains. Finally, the data obtained from this study could also 
be important for future infectious disease risk assessment and pre- 
vention during spaceflight missions and in general public health as 
well. 

Methods 

Bacterial strains and culture conditions. The reference K. pneumoniae strain KP 
ATCC BAA- 2146 was used in the study. K. pneumoniae was cultivated in semisolid 
medium (with 0.5% agar) for 15 days at 2VC, and the culture were placed in the 
following conditions: on the ground (D strain), in simulated space condition with 
rotation at 30 rpm and 10"' g microgravity (M strain), and in the Shenzhou 10 
spacecraft {T strain). For phenotypic analysis, the K. pneumoniae strains were 
routinely grown in Luria-Bertani (LB) broth. Agar was added to a final concentration 
of 1.5% when necessary. For growth curve measurement, the strains were grown at 
designated temperature (30^C, 37X or 40X) with shaking (180 rpm). The CDM 
used for growth curve measurement was described previously^'^. 

Drug susceptibility testing. Drug susceptibility testing (DST) for the K. pneumoniae 
strains was performed using the bioMe'rieux VITEK-2 AST-GN13 system following 
manufacturer's instructions as described previously^''. The following 18 drugs were 
tested: ampiciUin (AMP), piperacillin/tazobactam (TZP), ampicillin/ sulbactam 
(SAM), cefazolin (CFZ), ceftriaxone (CRO), ceftazidime (CAZ), cefepime (FEP), 
cefotetan (CTT), ertapenem (FTP), imipenem (IMP), aztreonam (ATM), 
ciprofloxacin (CIP), levofloxacin (LVX), gentamicin (GM), tobramycin (TOB), 
amikacin (AMK), trimethoprim- sulfamethoxazole (SXT), furadantin (FD). The 
ESBLs were detected by the bioMe'rieux VITFK-2 AST-GN13 test. E. coli strains 
ATCC 25922 and ATCC 35218, K. pneumoniae strain ATCC 700603 and P. 
aeruginosa strain ATCC 27853 were used as quality control strains for the DST. 

H2O2 sensitivity assays. To measure the susceptibility of K. pneumoniae to oxidative 
stress, H2O2 sensitivity assays were conducted according to the method reported by 
Cumley et al. with minor modifications'^. Briefly, overnight- grown bacteria were 
diluted 100 folds in fresh LB medium and grown to early stationary phase at 37^C 
with vigorous shaking. Bacteria were resuspended in 0.1 M sodium phosphate buffer 
(pH 7.4) with increasing concentrations of H2O2 (0 mM, 1 mM, 5 mM) without 
agitation. After 2 h incubation at 37^C, cultures were diluted and plated onto LB 
plates. All experiments were repeated at least three times. 

Biofilm formation assay. Bacteria were diluted 100 folds in LB broth, and was then 
inoculated into each well of a 96-well polystyrene flat-bottom microtiter plates and 
statically incubated at 37 C for 24 h. Then the wells were washed with phosphate- 
buffered saline (PBS) to remove unattached cells. Crystal violet (0. 1 % w/v; Sigma) was 
used to stain the attached cells for 30 min at 25^C. The plates were then washed with 
PBS and left to dry for a further 10 min, and the stained biomass was solubilized in 1% 
(w/v) SDS. The absorbance of each well was determined at OD595 nm- All experiments 
were repeated at least three times. 

Scanning electron microscopy. Bacterial cells, grown in LB medium, were fixed with 
2.5% (vol/vol) glutaraldehyde in phosphate -buffered sahne (PBS, pH 7.4) and 
subsequently post-fixed in 1% (wt/vol) osmium tetroxide. The samples were then 
dehydrated in acetone, critical- point dried, and coated with gold-palladium. The 
specimens were finally examined with a FEI Quanta 200 SFM scanning electron 
microscope (USA). 

Carbon source utilization assays and chemical sensitivity assays. The Biolog GEN 
III MicroPlate was used to analyze the K. pneumoniae strains in 94 phenotypic tests. 



which including 71 carbon source utilization assays and 23 chemical sensitivity 
assays. Briefly, the bacterial culture were picked up from the surface of the BUG-FB 
agar plate (Biolog, CA, USA) using a sterile cotton-tipped swab and inoculated into 
the IF-A Inoculum (Biolog, CA, USA). The target cell density of Inoculum was set to 
90-98% T by turbidimeter (BioMerieux, Lyon, Fance). Then 100 |j.l of the inoculum 
was added into each weU of the 96 GEN III MicroPlate™ (Biolog, CA, USA). After 
incubating the culture for 24 hours at 37''C, the OD590 readings were measured with a 
BIOLOG microplate reader automatically and further confirmed visually. 

Genome sequencing. Genome sequencing was performed by Beijing Genomics 
Institute (BGI, China). Briefly, the genomic DNA for each bacterium was prepared by 
conventional phenol-chloroform extraction methods. A 500 bp paired-end library 
was constructed for each purified DNA sample following the standard lUumina 
paired-end protocol with a low-cycle polymerase chain reaction during the fragment 
enrichment, and sequencing was performed on the lUumina Hiseq 2000 with 90 
cycles. Low quality reads were filtered using the DynamicTrim and LengthSort Perl 
scripts within SolexaQA. Short reads were assembled using SOAPc/enovo version 
2.04^ and the gaps were closed by GapCloser version 1.12. 

Genetic mutations detection and phylogenetic analysis. First, the short reads were 
aligned onto the K. pneumoniae ATCC BAA -2146 genome reference using the 
SOAP2 program"*^ Second, SOAPsnp was used to score SNPs from aligned reads'*^. 
SOAPsnp results were filtered as follows: (1) The read coverage of the SNP site was 
more than three, (2) The Illumina quality score of either allele was more than 30, and 
(3) The count of all mapped best base was more than two times the count of all 
mapped second best base. In addition, BWA 0.6.2 and SAMtools 0.1.18 were used to 
confirm the SNP results. The Illumina reads were first aligned by BWA with default 
parameters for each sample. The aligned results were piped to SAMtools to perform 
SNP and Indel analysis. 

Electronic PGR for MLST genotyping and identification of drug resistance- 
associated genes. MLST with seven genes {gap A, infB, mdh, pgi, phoE, rpoB and tonB) 
was performed on isolates according to the protocol described on the K. pneumoniae 
MLST website (www.pasteur.fr/mlst). STs were assigned by using the MLST database 
(www.pasteur.fr/mlst/Kpneumoniae.html). Electronic PGR was used to extract the 
drug resistance-associated genes, then the DNA sequences were annotated using the 
BLAST program at http://www.ncbi.nlm.nih.gov.Mutations in the ^^rA and parC 
genes were identified by comparing the DNA sequences with gyrA and parC 
sequences of the K. pneumoniae (GenBank accession numbers DQ673325 and 
NC009648 for gyrA and parC respectively). 

RNA-Seq and comparative transcriptomic data analysis. Total RNA isolation as 
well as construction and sequencing of cDNA libraries of the K. pneumoniae strains 
were conducted by BGI (China). Briefly, total RNA samples were isolated using a 
RNeasy Protect Bacteria Mini Kit (QIAGEN, Germany) according to the 
manufacturer's instructions. Sequencing was carried out by running 90 cycles on the 
Illumina Hiseq 2000. Paired-end reads were mapped to K. pneumoniae ATCC BAA - 
2146 (CP006659) reference sequence using Bowtie2^". The number of reads mapped 
to each gene was counted by using HTSeq-count (http://www-huber.embl.de/users/ 
anders/HTSeq). The edgeR package^^ from R/BioConductor was used to normalize 
the mapped count data and for differential gene expression analysis. Fold changes 
with FDR < 0.001 were considered to be statistically significant. RPKM (Reads Per kb 
per Million reads) values were provided to enable comparison of relative transcript 
abundance among difference samples. 

Functional annotation and enrichment analysis. KOBAS 2.0 (KEGG Orthology 
Based Annotation System) was used to identify metabolic pathways and to calculate 
the statistical significance of each pathway^^. The COG annotation was performed 
using the Blastall software against the Cluster of Orthologus Groups (COG) database. 
COG enrichment analysis was determined by comparing the prevalence of 
differentially expressed genes assigned to a specific COG category to the prevalence of 
genes in the whole genome assigned to that COG category with a Fisher's exact test. 

Quantitative RT-PCR of selected targets. To validate whether RNA-Seq provides 
reliable quantitative estimates of transcript levels, qRT-PGR analysis was conducted 
for 10 randomly selected genes. cDNA samples were analyzed by quantitative PGR 
with KAPA SYBR FAST qPCR Kit (KAPA Biosystems) on ABI 7300 system (AppUed 
Biosystems). Data were analyzed by the 2"^'^'^^ method and normalized to the gapdh. 
Each experiment was performed in triplicates and repeated at least three times. 
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